Markov Model Based Phoneme Class Partitioning for ImprovedConstrained Iterative Speech

نویسنده

  • Levent M. Arslan
چکیده

Research has shown that degrading acoustic background noise innuences speech quality across phoneme classes in a non-uniform manner. This results in variable quality performance of many speech enhancement algorithms in noisy environments. A phoneme classiication procedure is proposed which directs single-channel constrained speech enhancement. The procedure performs broad phoneme class partitioning of noisy speech frames using a continuous mixture hidden Markov model recognizer in conjunction with a perceptually motivated cost-based decision process. Once noisy speech frames are identiied, iterative speech enhancement based on all-pole parameter estimation with inter-and intra-frame spectral constraints is employed. The phoneme class directed enhancement algorithm is evaluated using TIMIT speech data and shown to result in substantial improvement in objective speech quality over a range of signal-to-noise ratios and individual phoneme classes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum cost based phoneme class detection for improved iterative speech enhancement

It is known that degrading acoustic noise innuences speech quality across phoneme classes in a non-uniform manner. This results in variable quality performance for many speech enhancement algorithms in noisy environments. To address this, a hidden-Markov-model phoneme classiica-tion procedure is proposed which directs single channel speech enhancement across individual phoneme classes. The proc...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement

171 A. Benyassine and H. Abut, “Mixture excitations and finite-state CELP speech coders,” in Proc. IEEE ICASSP., Mar. 1992, pp. 1-345-1-348. P. Krmn and B. S. Atal, “Strategies for improving the performance of CELP coders at low bit rates,” in Proc. IEEE ICASSP, Apr. 1988, pp. 151-154. P. moon and B. S. Atal, “On the use of pitch predictors with high temporal resolution,” IEEE Truns. Acoust., S...

متن کامل

Evaluation of a Speech Bandwidth Extension Algorithm Based on Vocal Tract Shape Estimation

In this paper, we evaluate a speech bandwidth extension (BWE) algorithm which involves phonetic and speaker dependent estimation of the high-band part of the spectral envelope. The BWE algorithm extracts speech phoneme information by using a hidden Markov model. Speaker vocal tract shape information corresponding to the wideband signal is extracted by a codebook search. Postprocessing of the es...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995